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(57) Abstract 

The invention relates to nucleic acids which contain particularly a nucleotide sequence extending from the extremity consti- 
tuted by the nucleotide at position (1) to the extremity constituted by the nucleotide at position (1211) represented on the figure, to 
the polypeptides coded by said nucleic acids. The polypeptides of the invention can be used for the diagnosis of tuberculosis, and 
can also be part of the active principle in the preparation of a vaccine against tuberculosis. 
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Mycobacterium polypeptides and nucleic acids encoding them for diagnosis 

and control of tuberculosis 



The invention relates to polypeptides and 
peptides, particularly recombinant polypeptides and 
peptides, which can be used for the diagnosis of 
tuberculosis. The invention also relates to a process 
for preparing the above-said polypeptides and peptides, 
which are in a state of biological purity such that 
they can be used as part of the active principle in the 
preparation of vaccines against tuberculosis. 

It also relates to nucleic acids coding for said 
polypeptides and peptides. 

Furthermore, the invention relates to the in vitro 
diagnostic methods and kits using the above-said 
polypeptides and peptides and to the vaccines 
containing the above-said polypeptides and peptides as 
active principle against tuberculosis. 

By "recombinant polypeptides or peptides" it is to 
be understood that it relates to any molecule having a 
polypeptidic chain liable to be produced by genetic 
engineering, through transcription and translation, of 
a corresponding DNA sequence under the control of 
appropriate regulation elements within an efficient 
cellular host. Consequently, the expression 
"recombinant polypeptides" such as is used herein does 
not exclude the possibility for the polypeptides to 
comprise other groups, such as glycosylated groups. 

The term "recombinant" indeed involves the fact 
that the polypeptide has been produced by genetic 
engineering, particularly because it results from the 
expression in a cellular host of the corresponding 
nucleic acid sequences which have previously been 
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introduced into the ejcpression vector used in said 
host. 

Nevertheless, it must be understood that this 
expression does not exclude the possibility for the 
polypeptide to be produced by a different process, for 
instance by classical chemical synthesis according to 
methods used in the protein synthesis or by proteolytic 
cleavage of larger molecules. 

The expression "biologically pure" or "biological 
purity" means on the one hand a grade of purity such 
that the recombinant polypeptide can be used for the 
production of vaccinating compositions and on the other 
hand the absence of contaminants, more particularly of 
natural contaminants. 

Tuberculosis remains a major disease in developing 
countries. The situation is dramatic in some countries, 
particularly where high incidence of tuberculosis among 
AIDS patients represents a new source of dissemination 
of the disease. 

Tuberculosis is a chronic infectious disease in 
which cell-mediated immune mechanisms play an essential 
role both for protection against and control of the 
disease . 

Despite BCG vaccination, and some effective drugs, 
tuberculosis remains a major global problem. Skin 
testing with tuberculin ppd (protein-purified 
derivative) largely used for screening of the disease 
is poorly specific, due to cross reactivity with other 
pathogenic or environmental saprophytic mycobacteria. 

Moreover, tuberculin PPD when used in serological 
tests (ELISA) does not permit discrimination between 
patients who have been vaccinated by BCG, or those who 
have been primo-infected, from those who are developing 
evolutive tuberculosis and for whom an early and rapid 
diagnosis would be necessary. 
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A protein with a molecular weight of 32-kDa has 
already been purified from zinc deficient bovis BCG 
culture filtrate. This protein was identified as 
antigen 85A (De Bruyn J. et al., 1987, "Purification, 
partial characterization and identification of a 32-kDa 
protein antigen of Mycobacterium bovis BCG " Microb. 
Pathogen. 2:351). Its NHg-terminal amino acid sequence 
(Phe-Ser-Arg-Pro-Gly-Leu) is identical to that reported 
for the a-antigen (antigen 85B) protein purified from 
Mi bovis BCG (Wiker, H.G. et al., 1986, "MPBSg, a 
widely cross-reacting protein of Mycobacterium bovis 
BCG" Int. Arch. Allergy Appl. Immunol. 81:307). The 
antigen 85-complex is present among different strains 
of mycobacteria (De Bruyn J. et al., 1989, "Effect of 
zinc deficiency of the appearance of two immunodominant 
protein antigens (32-kDa and 65-kDa) in culture 
filtrates of Mycobacteria" J. Gen Microbiol. 135:79) . 
It is secreted by living bacilli as a predominant 
protein in normal Sauton culture filtrate and could be 
useful in the serodiagnosis of tuberculosis (Turneer M. 
et al., 1988, "Humoral immune response in human 
tuberculosis: immunoglobulins G, A and M directed 
against the purified P32 protein antigen of 
Mycobacterium bovis bacillus Calmette-Guerin " J. Clin. 
Microbiol. 26:1714) and leprosy (Rumschlag H.S. et al., 
1988, "Serological response of patients with 
lepromatous and tuberculosis leprosy to 30-, 3i- and 
32-kilodalton antigens of Mycobacterium tuberculosis " 
J. Clin. Microbiol. 26:2200). Furthermore, the 32-kDa 
protein-induced specific lymphoproliferation and 
inter feron-7(IFN-7) production in peripheral blood 
leucocytes from tuberculosis (Huygen K. et al., 1988, 
"Specific lymphoproliferation, 7-interferon production 
and serum immunoglobulin G directed against a purified 
32-kDa mycobacterial antigen (P32) in patients with 
active tuberculosis" Scand. J, Immunol. 27:187), and 
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leprosy patients and from PPD- and lepromin -positive 
healthy sxibjects. Recent findings indicate that the 
amount of 32 kDa protein induced IFN-7 in BCG- 
sensitized mouse spleen cells is under probable H-2 
control (Huygen K. et al, 1989, "H-2-linked control of 
in vitro 7 interferon production in response to a 32- 
kilodalton antigen (P32) of Mycobacterium bovis 
bacillus Calmett e-Guer in " Infect, Imm. 56:3196). 
Finally, the high affinity of mycobacteria for 
fibronectin is related to proteins of the antigen 85- 
complex (Abou-Zeid C- et al., 1988, "Characterization 
of f ibronectin-binding antigens released by 
Mycobacterium tuberculosis and Mycobacterium bovis BCG " 
Infect, Imm. 56:3046). 

Wiker et al. (Wiker H.G. et al., 1990, "Evidence 
for three separate genes encoding the proteins of the 
mycobacterial antigen 85 complex" Infect. Immun. 
58:272) showed recently that the antigens 85A, B and C 
isolated from M^ bovis BCG culture filtrate present a 
few amino acid replacements in their NH2 terminal 
region strongly suggesting the existence of multiple 
genes coding for these proteins. But, the data given 
for the antigen 85C of M^ bovis BCG are insufficient to 
enable its unambiguous identification as well as the 
characterization of its structural and functional 
elements. 

The gene encoding the 85A antigen from 
Mycobacterium tuber cul os i s has been described 
(Borremans L. et al., 1989, "Cloning, sequence 
detemination and expression of a 32-kilodalton protein 
gene of Mycobacterium tuberculos is " Infect. Immun. 
57:3123) which presented 77.5% homology at the DNA 
level within the coding region with the a-antigen gene 
(85B gene of bovis BCG , substrain Tokyo) (Matsuo K. 
et al., 1988, "Cloning and expression of the 
Mycobacterium bovis BCG gene for extracellular a- 
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antigen" J. Bacteriol. 170:3847) • Moreover, recently a 
corresponding 32-kDa protein genomic clone from a Agtll 
BCG library (prepared from strain bovis BCG 1173P2 ) 
was isolated and sequenced. The complete sequence of 
this gene is identical with that from the 85A gene of 
Mycobacterium tuberculosis except for a single silent 
nucleotide change (De Wit L. et al., 1990, "Nucleotide 
sequence of the 32 kDa-protein gene (antigen 85A) of 
Mycobacterium bovis BCG " Nucl. Ac. Res. 18:3995). Thus, 
it was likely, but not demonstrated, that the genome of 
M^ bovis BCG contained at least two genes coding for 
antigen 85A and 85B respectively. As to the genome of 
the Mycobacterium tuberculosis and M^ bovis, nothing 
was proved as to the existence of new genes, besides 
the genes coding respectively for 85A and 85B. 

An aspect of the invention is to provide a new 
family of nucleic acids coding for new proteins and 
polypeptides which can be used for the detection and 
control of tuberculosis. 

Another aspect of the invention is to provide 
nucleic acids coding for the peptidic chains of 
biologically pure recombinant polypeptides which enable 
their preparation on a large scale. 

Another aspect of the invention is to provide 
antigens which can be used 

- in serological tests as an in vitro rapid diagnostic 
test for tuberculosis or in skin test, 

- or as immunogenic principle of a vaccine. 

Another aspect of the invention is to provide a 
rapid in vitro diagnostic means for tuberculosis, 
enabling it to discriminate between patients suffering 
from an evolutive tuberculosis from those who have been 
vaccinated against BCG or who have been primo-infected. 

Another aspect of the invention is to provide 
nucleic probes which can be used as in vitro diagnostic 
reagents for tuberculosis as well as in vitro 
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diagnostic reagents for identifying tuberculosis 
from other strains of mycobacteria. 

The nucleic acids of the invention 

* contain a nucleotide sequence extending from the 
extremity constituted by the nucleotide at position (1) 
to the extremity constituted by the nucleotide at 
position (149) represented on Figure 1,. 

* or contain one at least of the nucleotide sequences 
coding for the following peptides or polypeptides: 

- the one extending from the extremity constituted 
by amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- the one extending from the extremity constituted 
by amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- SQSNGQNY, or 

- PMVQIPRLVA, or 

- GLTUITNQTFRDTYAADGGRNG, or 

- PPAAPAAPAA, 

* or contain nucleotidic sequences: 

- hybridizing with the above-mentioned nucleotide 
sequences, or their complements, 

- complementary to the above-mentioned nucleotide 
sequences, or 

- which are the above-mentioned nucleotide 
sequences wherein T can be replaced by U, 

* or are constituted by the above-mentioned nucleotide 
sequences . 

SQSNGQNY is a sequence corresponding to the one 
extending from position 84 to position 91 of 85C 
sequence represented on Figure 1. 

PMVQIPRLVA is a sequence corresponding to the one 
extending from position 191 to position 200 of 85C 
sequence represented on Figure 1. 
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GLTLRTNQTFRDTYAADGGRNG is a sequence corresponding 
to the one extending from position 229 to position 250 
of 85C sequence represented on Figure 1. 

PPAAPAAPAA is a sequence corresponding to the one 
extending from position 285 to position 294 of BSC 
sequence represented on Figure 1. 

The hybridization takes place under the following 
conditions: 

- hybridization and wash medium: 

* a preferred hybridization medium contains about 
3 X SSC [SSC = 0,15 M sodium chloride, 0.015 M sodium 
citrate, pH 7] about 25 mU of phosphate buffer pH 7.1, 
and 20% deionized formamide, 0.02% Ficoll, 0,02% BSA, 
0.02% polyvinylpyrrolidone and about 0.1 mg/ml sheared 
denatured salmon sperm DNA, 

* a preferred wash medium contains about 3 x SSC, 
about 25 mM phosphate buffer, pH 7.1 and 20% deionized 
formamide; 

- hybridization temperature (HT) and wash temperature 
(WT) are between 45 'C and 65 'C; 

for the nucleotide sequence extending from the 
extremity constituted by the nucleotide at position (1) 
to the extremity constituted by the nucleotide at 
position (149) represented on Figure 1: 
HT ^ WT = 65 "C 

for the nucleic acids of the invention defined by coded 
polypeptides X - Y: i.e. 

the sequence extending from the extremity 
constituted by the amino acid at position (X) to the 
extremity constituted by the amino acid at position (Y) 
represented on Figure 1, 

the sequence extending from the extremity 
constituted by the amino acid at position (-46) to the 
extremity constituted by the amino acid at position 
(-1) represented on Figure 1, 
HT WT = 65'C 
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the sequence extending from the extremity 
constituted by the amino acid at position (-21) to the 
extremity constituted by the amino acid at position 
(^1) represented on Figure 1, 
HT = WT ^ SO'C 

for the nucleic acids defined by coded polypeptides 
represented by their sequence: 

• SQSNGQNY HT = WT = 45*0 

. PMVQIPRLVA HT = WT = 55-C 

. GLTmTNQTFRDTYAADGGRNG HT = WT 65*0 

• PPAAPAAPAA HT = WT = 65**C. 

The above-mentioned temperatures are to be 
expressed as approximately ± 

Advantageous nucleic acids of the invention 
contain at least one of the following nucleotide 
sequences: 

- the one extending from the extremity constituted by 
the nucleotide at position (150) to the extremity 
constituted by the nucleotide at position (287) on 
Figure 1, 

- the one extending from the extremity constituted by 
the nucleotide at position (224) to the extremity 
constituted by the nucleotide at position (287) on 
Figure 1, 

- the one extending from the extremity constituted by 
the nucleotide at position (537) to the extremity 
constituted by the nucleotide at position (560) on 
Figure 1, 

- the one extending from the extremity constituted by 
the nucleotide at position (858) to the extremity 
constituted by the nucleotide at position (887) on 
Figure 1^ 

- the one extending from the extremity constituted by 
the nucleotide at position (972) to the extremity 
constituted by the nucleotide at position (1037) on 
Figure 1, 
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- the one extending from the extremity constituted by 
the nucleotide at position (1140) to the extremity 
constituted by the nucleotide at position (1169) on 
Figure 1, 

or contain nucleotidic sequences: 

- hybridizing with the above-mentioned nucleotide 
sequences , or 

- complementary to the above-mentioned nucleotide 
sequences , or 

which are the above-mentioned nucleotide 
sequences wherein T can be replaced by U, 
or are constituted by the above-mentioned nucleotide 
sequences . 

The hybridization takes place under the following 
conditions : 

- hybridization and wash medium are as defined above ; 

- hybridization temperature (HT) and wash temperature 
(WT) for the nucleic acids of the invention defined by 
X - Y: i.e« by the sequence extending from the 
extremity constituted by the nucleotide at position (X) 
to the extremity constituted by the nucleotide at 
position (Y) represented on Figure 1: 

(150) - (287) HT « WT = 65'C 

(224) - (287) HT = WT « 60'C 

(537) - (560) HT = WT = 45'C 

(858) - (887) HT = WT = 55-^0 

(972) - (1037) HT = WT = 65*^C 

(1140) - (1169) HT = WT = 65'C. 

An advantageous group of nucleic acids of the 
invention contains the nucleotide sequence coding for 
the following peptide: 

SQSNGQNY 

and possibly containing the nucleotide sequence coding 
for the following peptide: 

FSRPGLPVEYLQVP 
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and liable to hybridize with the folloving nucleotide 
sequence: 

CGGCTGGGAC ( or T) ATCAACAC CCCGGC 

and liable to hybridize neither with 

GCCTGCGGCAAGGCCGGTTGCCAG 

nor with 

GCCTGCGGTAAGGCTGGCTGCCAG 
nor with 

GCCTGCGGCAAGGCCGGCTGCACG 

or are constituted by the above-mentioned hybridizing 
nucleotide sequences. 

The above-mentioned hybridization can take place 
when the hybridization and wash medium is as indicated 
above; and the hybridization and wash temperature is 
52-C. 

The expression "not liable to hybridize with" 
means that the nucleic acid molecule of the invention 
does not contain a stretch of nucleotide hybridizing at 
52 in the above defined medium with the three probes 
defined above • 

Advantageous nucleic acids of the invention 
contain one at least of the above-mentioned nucleotide 
sequences or are constituted by the above-mentioned 
nucleotide sequences and besides contain an open 
reading frame coding for a polypeptide: 

- liable to react selectively with human sera from 
tuberculosis patients and particularly patients 
developing an evolutive tuberculosis, 

- or liable to be recognized by antibodies also 
recognizing the amino acid sequence extending from the 
extremity constituted by amino acid at position (1) to 
the extremity constituted by amino acid at position 
(294) represented on Figure 1, 

- or liable to generate antibodies recognizing the 
amino acid sequence extending from the extremity 
constituted by amino acid at position (1) to the 
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extremity constituted by amino acid at position (294) 
represented on Figure 1. 

The recognition of the above-mentioned sequence of 
the 294 amino acids (or of the polypeptides of the 
invention) by the abovesaid antibodies means that the 
abovesaid sequence forms a complex with one of the 
above-mentioned antibodies . 

Forming a complex between the antigen (i.e. the 
sequence of 294 amino acids or any polypeptide of the 
invention) and the antibodies and detecting the 
existence of a formed complex can be done according to 
classical techniques (such as the one using a tracer 
labeled with radioactive isotopes or an enzyme) . 

Hereafter is given, in a non-limitative way, a 
process for testing the selective reaction between the 
antigen and human sera from tuberculosis patients and 
particularly patients developing an evolutive 
tuberculosis . 

This test is an immunoblotting (Western blotting) 
analysis, in the case where the polypeptides of the 
invention are obtained by recombinant techniques. This 
test can also be used for polypeptides of the invention 
obtained by a different preparation process. After 
sodium dodecyl sulfate - polyacrylamide gel 
electrophoresis, polypeptides of the invention are 
blotted onto nitrocellulose membranes (Hybond C, 
(Amersham)) as described by Towbin H. et al., 1979, 
"Electrophoretic transfer of proteins from 
polyacrylamide gels to nitrocellulose sheets: procedure 
and some applications" Proc. Natl. Acad. Sci. USA 
76:4350-4354. The expression of polypeptides of the 
invention fused to )3-galactosidase in coli Y1089, is 
visualized by the binding of a polyclonal rabbit anti- 
antigen 85 serum (1:1,000) or by using a monoclonal 
anti-)S-galactosidase antibody (Promega) . The secondary 
antibody (alkaline phosphatase anti-rabbit 
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immunoglobulin G and anti-mouse alkaline phosphatase 
immunoglobulin G conjugates, respectively) is diluted 
as recommended by the supplier (Promega) . 

In order to identify selective recognition of 
polypeptides of the invention and of fusion proteins of 
the invention by human tuberculous sera, nitrocellulose 
sheets are incubated overnight with these sera (1:50) 
(after blocking aspecif ic protein-binding sites) . 
Reactive areas on the nitrocellulose sheets are 
revealed by incubation with peroxidase-conjugated goat 
anti-human immunoglobulin G antibody (Dakopatts, 
Copenhagen, Denmark) (1:200) for 4 h. After repeated 
washings, color reaction is developed by adding 
peroxidase substrate (a-chloronaphtol) (Bio-Rad 

Laboratories, Richmond, Calif.) in the presence of 
peroxidase and hydrogen peroxide. 

Advantageous nucleic acids of the invention 
contain or are constituted by one of the above- 
mentioned nucleotide sequences, contain an open reading 
frame and code for a mature polypeptide of about 30 to 
about 35 kD, and contain a sequence coding for a signal 
sequence. 

Advantageous nucleic acids of the invention 
contain one at least of the nucleotide sequences 
coding for the following polypeptides: 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
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constituted by amino acid at position (294) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (l) to the extremity constituted 
by amino acid at position (294) represented on Figure 
1, 

or contain nucleotidic sequences: 

- hybridizing with the above-mentioned nucleotide 
sequences , or 

- complementary to the above-mentioned nucleotide 
sequences , or 

which are the above-mentioned nucleotide 
sequences wherein T can be replaced by U, 
or are constituted by the above-mentioned nucleotide 
sequences . 

The hybridization takes place under the following 
conditions: 

- hybridization and wash medium are as above defined; 

- hybridization temperature (HT) and wash temperature 
(WT) for the nucleic acids of the invention defined by 
coded polypeptides X - Y: i.e. by the coded sequence 
extending from the extremity constituted by the amino 
acid at position (X) to the extremity constituted by 
the amino acid at position (Y) represented on Figure 1: 
(-46) - (-1) HT - WT = 65*0 

(-21) - (-1) HT - WT = 60'C 

(-46) - (294) HT = WT = 70'C 

(-21) - (294) HT = WT = 70**C 

(1) - (294) HT = WT = VO^C. 

Advantageous nucleic acids of the invention 
contain one at least of the following nucleotide 
sequences : 
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- the one extending from the extremity constituted by 
the nucleotide at position (150) to the extremity 
constituted by the nucleotide at position (287) 
represented on Figure 1^ or 

- the one extending from the extremity constituted by 
the nucleotide at position (224) to the extremity 
constituted by the nucleotide at position (287) 
represented on Figure 1, or 

- the one extending from the extremity constituted by 
the nucleotide at position (1) to the extremity 
constituted by the nucleotide at position (1169) 
represented on Figure 1, or 

- the one extending from the extremity constituted by 
the nucleotide at position (150) to the extremity 
constituted by the nucleotide at position (1169) 
represented on Figure 1^ or 

- the one extending from the extremity constituted by 
the nucleotide at position (224) to the extremity 
constituted by the nucleotide at position (1169) 
represented on Figure 1, or 

- the one extending from the extremity constituted by 
the nucleotide at position (288) to the extremity 
constituted by the nucleotide at position (1169) 
represented on Figure 1, 

- the one extending from the extremity constituted by 
the nucleotide at position (1) to the extremity 
constituted by the nucleotide at position (1211) 
represented on Figure 1, 

- the one extending from the extremity constituted by 
the nucleotide at position (150) to the extremity 
constituted by the nucleotide at position (1211) 
represented on Figure 1^ 

- the one extending from the extremity constituted by 
the nucleotide at position (224) to the extremity 
constituted by the nucleotide at position (1211) 
represented on Figure 1, 
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- the one extending from the extremity constituted by 
the nucleotide at position (288) to the extremity 
constituted by the nucleotide at position (1211) 
represented on Figure 1, 
or contain nucleotidic sequences: 

- hybridizing with the above-mentioned nucleotide 
sequences, or 

- complementary to the above-mentioned nucleotide 
sequences , or 

which are the above-mentioned nucleotide 
sequences wherein T can be replaced by u, 
or are constituted by one at least of the following 
nucleotide sequences. 

The hybridization takes place under the following 
conditions : 

- hybridization and wash medium are as above defined; 

- hybridization temperature (HT) and wash temperature 
(WT) for the nucleic acids of the invention defined for 
the nucleic acids of the invention defined by X - Y: 
i.e. by the sequence extending from the extremity 
constituted by the nucleotide at position (X) to the 
extremity constituted by the nucleotide at position (Y) 
represented on Figure 1: 

(150) - (287) HT = WT = 65*C 
(224) - (287) HT = WT - 60*C 
(150) - (1169) HT - WT = 70*C 
(1) - (1169) HT = WT = 70''C 
(224) - (1169) HT = WT = 70**C 
(288) - (1169) HT = WT = 70'C 

The invention relates also to the polypeptides 
coded by the nucleic acids of the invention above 
defined. 

Advantageous polypeptides of the invention contain 
at least one of the following amino acid sequences in 
their polypeptide chain: 
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- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, 

- or the one extending from the extremity constituted 
by amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- SQSNGQNY, or 

- PMVQIPRLVAr or 

- GLTLRTNQTFRDTYAADGGRNG, or 

- PPAAPAAPAA^ 

or are constituted by the above-mentioned polypeptide 
sequences . 

The invention also relates to polypeptides 
containing, in their polypeptide chain,, the following 
amino acid sequence: 

SQSNGQNY 

and possibly the amino acid sequence 

GWDINTPA 

and possibly the amino acid sequence 

FSRPGLPVEYLQVP 
and containing not the amino acid sequence 

ACGKAGCQ 

and not the amino acid sequence 

ACGKAGCT 

Advantageous polypeptides of the invention contain 
in their polypeptide chain the following amino acid 
sequences : 

SQSNGQNY 
GWDINTPA 
FSRPGLPVEYLQVP 
and one at least of the following amino acid sequences: 

PMVQIPRLVA^ 
GLTLRTNOtFRDT YAADGGRNG , 
PPAAPAAPAA, 
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and containing not the amino acid sequence 

ACGKAGCQ 

and not the amino acid sec[uence 

ACGKAGCT. 
The following polypeptides are new: 

SQSNGQNY, 
PMVQIPRLVA, 
GLTLRTNQTFRDT YAADGGRNG , 
PPAAPAAPAA. 

Advantageous polypeptides of the invention are 
liable to react selectively with human sera from 
tuberculosis patients and particularly patients 
developing an evolutive tuberculosis, 

or liable to be recognized by antibodies also 
recognizing the polypeptide sequence extending from the 
extremity constituted by amino acid at position (1) to 
the extremity constituted by amino acid at position 
(294) represented on Figure 1, 

or liable to generate antibodies recognizing the 
polypeptidic sequence extending from the extremity 
constituted by amino acid at position (1) to the 
extremity constituted by amino acid at position (294) 
represented on Figure 1. 

The invention also includes the peptidic sequences 
resulting from the modification by substitution and/or 
by addition and/ or by deletion of one or several amino 
acids in the above defined polypeptides and peptides in 
so far as this modification does not alter the 
following properties: 

selective reaction with human sera from tuberculosis 
patients and particularly patients developing an 
evolutive tuberculosis, 

and/or reaction with antibodies raised against the 
amino acid sequence extending from the extremity 
constituted by amino acid at position (1), to the 
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extremity constituted by amino acid at position (294) 
represented on Fig. 1. 

Advantageous polypeptides of the invention contain 
or are constituted by one of the above-mentioned 
polypeptide sequences, and are about 30 to about 35 JcD 
and are preceded by a signal peptide. 

Advantageous polypeptides of the invention contain 
in their polypeptide chain, one at least of the 
following amino acid sequences or are constituted by 
one of the following amino acid sequences: 

- the one extending from the extremity constituted by 
amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on Figure 
1, 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1, 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1, 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure !• 

It goes without saying that the free reactive 
functions which are present in some of the amino acids, 
which are part of the constitution of the polypeptides 
of the invention, particularly the free carboxyl groups 
which are carried by the groups Glu or Asp or by the 
C-terminal amino acid on the one hand and/or the free 
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NH2 groups carried by the N-terminal amino acid or by 
amino acid inside the peptidic chain, for instance Lys, 
on the other hand, can be modified insofar as this 
modification does not alter the above-mentioned 
properties of the polypeptide. 

The molecules which are thus modified are 
naturally part of the invention. The above-mentioned 
carboxyl groups can be acylated or ester ified. 

Other modifications are also part of the 
invention. Particularly, the amine or ester functions 
or both of terminal amino acids can be themselves 
involved in the bond with other amino acids. For 
instance, the N-terminal amino acid can be linked to a 
sequence comprising from 1 to several amino acids 
corresponding to a part of the C-terminal region of 
another peptide. 

The polypeptides according to the invention can be 
glycosylated or not, particularly in some of their 
glycosylation sites of the type Asn-X-Ser or Asn-X-Thr, 
X representing any amino acid. 

Other advantageous polypeptides of the invention 
consist in one of the following amino acid sequences: 

- the one extending from the extremity constituted by 
amino acid at position C"46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, 

- or the one extending from the extremity constituted 
by amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1. 

These polypeptides can be used as signal peptides, 
the role of which is to initiate the translocation of a 
protein from its site of synthesis to the membrane and 
which is excised during translocation. 

Advantageous polypeptides of the invention are the 
ones constituted by: 



wo 92/14823 



PCT/EP92/00268 



20 

- SQSNGQNY, 

- PMVQIPRLVA, 

- GLTLRTNQTFRDTYAADGGRNG, 

- PPAAPAAPAA, 

- the one extending from the extremity constituted by 
amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on Figure 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1, 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1^ 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure !• 

All these polypeptides are new* 

Other interesting polypeptides, which are common 
to the already known sequences of antigens 85A, 85B and 
85C of tuberculosis , M> bovis and kansasii are 
(see Figure 2 A) 

GWDINTPA, 

and 

FSRPGLPVE YLQVP • 
It is to be noted that the above-mentioned 
polypeptides are derived from the expression products 
of a DNA derived, as explained hereafter in the 
examples , 
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- from the nucleotide sequence coding for a protein of 
3 3-kDa secreted by Mycobacterium tuberculosis or 

- from the partial nucleotide sequence coding for a 
protein of 33-kDa secreted by bovis BCG, or 

- from related nucleotide sequences which will be 
hereafter designated by 85C genes. 

The invention also relates to the amino acid 
sequences constituted by the above-mentioned 
polypeptides and a protein or an heterologous sequence 
with respect to said polypeptide, said protein or 
heterologous sequence comprising for instance from 
about 1 to about 1000 amino acids. These amino acid 
sequences will be called fusion proteins. 

In an advantageous fusion protein of the 
invention, the heterologous protein is ^-galactosidase. 

The invention also relates to any recombinant 
nucleic acids containing at least one of the nucleic 
acids of the invention inserted in a heterologous 
nucleic acid. 

The invention relates more particularly to 
recombinant nucleic acid such as defined, in which the 
nucleotide sequence of the invention is preceded by a 
promoter (particularly an inducible promoter) under the 
control of which the transcription of said sequence is 
liable to be processed and possibly followed by a 
sequence coding for transcription termination signals. 

The invention also relates to the recombinant 
nucleic acids in which the nucleic acid sequences 
coding for the polypeptide of the invention and 
possibly the signal peptide, are recombined with 
control elements which are heterologous with respect to 
the ones to which they are normally associated with in 
the mycobacterial genome, more particularly, the 
regulation elements adapted to control their expression 
in the cellular host which has been chosen for their 
production. 
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Tlie invention also relates to recombinant vectors, 
particularly for cloning and/or expression, comprising 
a vector sequence, notably of the type plasmid, cosmid 
or phage DNA or virus DNA, and a recombinant nucleic 
acid of the invention, in one of the non-essential 
sites for its replication. 

According to an advantageous embodiment of the 
invention, the recombinant vector contains, in one of 
its non-essential sites for its replication, necessary 
elements to promote the expression of polypeptides 
according to the invention in a cellular host and 
notably a promoter recognized by the RNA polymerase of 
the cellular host, particularly an inducible promoter 
and possibly a sequence coding for transcription 
termination signals and possibly a signal sequence 
and/or an anchor sequence. 

According to another additional embodiment of the 
invention, the recombinant vector contains the elements 
enabling the expression by coli of a nucleic acid 
according to the invention inserted in the vector, and 
particularly the elements enabling the expression of 
the gene or part thereof of ^-galactosidase. 

The invention also relates to a cellular host 
which is transformed by a recombinant vector according 
to the invention, and containing the regulation 
elements enabling the expression of the nucleotide 
sequence coding for the polypeptide according to the 
invention in this host. 

The invention also relates to a cellular host 
chosen from among bacteria such as Ej^ coli, transformed 
by a vector as defined above, or chosen from among 
eukaryotic organism, such as CHO cells or insect cells, 
transfected by a vector as above defined. 

The invention relates to an expression product of 
a nucleic acid expressed by a transformed cellular host 
according to the invention. 
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The invention also relates to the use of any 
secreted polypeptide of the invention as a carrier 
antigen for foreign epitopes (epitopes of a polypeptide 
sequence heterologous with respect to the polypeptides 
of the invention) in the Mycobacterium bovis BOG 
vaccine strain. 

The Mycobacterium bovis BCG vaccine strain used 
can be available from Institut Pasteur (Paris) , under 
1173P2. 

The recombinant DNA comprising the nucleic acid 
coding for anyone of the polypeptides of the invention 
and the nucleic acid coding for any foreign epitopes as 
defined above, can contain the promoter sequence of 
said polypeptide of the invention, the signal sequence 
of said polypeptide, possibly the coding part of said 
polypeptide and the coding nucleic acid of the foreign 
epitope, said nucleic acid of the foreign epitope being 
for instance 

- either directly located after the signal sequence, 
and if the coding part of the the polypeptide of the 
invention is present, upstream from the coding part of 
the polypeptide of the invention, 

- or located downstream from the coding part of the 
polypeptide of the invention, 

- or located within the coding part of the polypeptide 
of the invention. 

The recombinant DNA as above defined can be 
transformed into the vaccine strain BCG where it leads 
to the expression and secretion of a recombinant 
protein antigen. 

From the nucleic acids of the invention, probes 
(i.e. cloned or synthetic oligonucleotides) can be 
inferred. 

These probes can be from 15 to the maximum number 
of nucleotides of the selected nucleic acids. The 
oligonucleotides can also be used either as 
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amplification primers in the PCR technique (PCR, Mullis 
and Faloona, Methods in Enzymology, vol. 155, p. 335, 
1987) to generate specific enzymatically amplified 
fragments and/or as probes to detect fragments 
amplified between bracketing oligonucleotide primers. 

The specificity of a PCR-assisted hybridization 
assay can be controlled at different levels. 

The amplification process or the detection process 
or both can be specific. The latter case giving the 
higher specificity is preferred. 

The invention also relates to a process for 
preparing a polypeptide according to the invention 
comprising the following steps: 

- the culture in an appropriate medium of a cellular 
host which has previously been transformed by an 
appropriate vector containing a nucleic acid according 
to the invention, 

- the recovery of the polypeptide produced by the 
abovesaid transformed cellular host from the abovesaid 
culture, and 

the purification of the polypeptide produced, 
eventually by means of immobilized metal ion affinity 
chromatography CIMAC) . 

The polypeptides of the invention can be prepared 
according to the classical techniques in the field of 
peptide synthesis. 

The synthesis can be carried out in homogeneous 
solution or in solid phase. 

For instance, the synthesis technique in 
homogeneous solution which can be used is the one 
described by Houbenweyl in the book entitled "Methode 
der organischen chemie" (Method of organic chemistry) 
edited by E. Wunsh, vol. 15-1 et II. THIEME, Stuttgart 
1974. 

The polypeptides of the invention can also be 
prepared in solid phase according to the methods 
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described by Atherton and Shepard in their book 
entitled "Solid phase peptide synthesis" (IRL Press, 
Oxford, New York, Tokyo, 1989) . 

The invention also relates to a process for 
preparing the nucleic acids according to the invention. 

A suitable method for chemically preparing the 
single-stranded nucleic acids (containing at most 100 
nucleotides of the invention) comprises the following 
steps : 

- DNA synthesis using the automatic j3-cyanoethyl 
phosphoramidite method described in Bioorganic 
Chemistry 4; 274-325, 1986. 

In the case of single-stranded DNA, the material 
which is obtained at the end of the DNA synthesis can 
be used as such. 

A suitable method for chemically preparing the 
double-stranded nucleic acids (containing at most 
100 bp of the invention) comprises the following steps: 

- DNA synthesis of one sense oligonucleotide using 
the automatic ^-cyanoethyl phosphoramidite method 
described in Bioorganic Chemistry 4; 274-325, 1986, and 
DNA synthesis of one anti-sense oligonucleotide using 
said above-mentioned automatic p-cyanoethyl 
phosphoramidite method, 

coiabining the sense and anti-sense 
oligonucleotides by hybridization in order to form a 
DNA duplex, 

- cloning the DNA duplex obtained into a suitable 
plasmid vector and recovery of the DNA according to 
classical methods, such as restriction enzyme digestion 
and agarose gel electrophoresis. 

A method for the chemical preparation of nucleic 
acids of length greater than 100 nucleotides - or base 
pairs, in the case of double-stranded nucleic acids - 
comprises the following steps: 
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- assembling of chemically synthesized 
oligonucleotides, provided at their ends with different 
restriction sites, the sequences of which are 
compatible with the succession of amino acids in the 
natural peptide, according to the principle described 
in Proc. Nat. Acad. Sci. USA 80; 7461-7465, 1983, 

- cloning the DNA thereby obtained into a suitable 
plasmid vector and recovery of the desired nucleic acid 
according to classical methods, such as restriction 
enzyme digestion and agarose gel electrophoresis. 

The invention also relates to antibodies 
themselves formed against the polypeptides according to 
the invention. 

It goes without saying that this production is not 
limited to polyclonal antibodies. 

It also relates to any monoclonal antibody 
produced by any hybridoma liable to be formed according 
to classical methods from splenic cells of an animal, 
particularly of a mouse or rat, immunized against the 
purified polypeptide of the invention on the one hand, 
and of cells of a myeloma cell line on the other hand, 
and to be selected by its ability to produce the 
monoclonal antibodies recognizing the polypeptide which 
has been initially used for the immunization of the 
animals. 

The invention also relates to any antibody of the 
invention labeled by an appropriate label of the 
enzymatic, fluorescent or radioactive type. 

The peptides which are advantageously used to 
produce antibodies, particularly monoclonal antibodies, 
are the following ones listed in Table 1 (referring to 
Figure 1) : 
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Table 1 






38 


HgN-DGLRAQDDYNGWDINTPAFE- 


■COOH 


57 


78 


HgN-TDWYQPSQSNGQNYTYKWET- 


•COOH 


97 


174 


HgN-ANSMWGPSSDPAWKRNDPMV- 


■COOH 


193 


204 


HgN-RIWVYCGNGTPSDLGGDNIP- 


•COOH 


223 


235 


HgN-NQTFRDTYAADGGRNGVFNF- 


•COOH 


254 


250 


HjN-GVFNFPPNGTHSWPYWNEQL- 


•COOH 


269 


275 


HzN-DIQHVLNGATPPAAPAAPAA- 


•COOH 


294 



The amino acid sequences are given in the one- 
letter code. 

Variations of the peptides listed in Table 1 are 
also possible depending on their intended use. For 
example, if the peptides are to be used to raise 
antisera, the peptides may be synthesized with an extra 
cysteine residue added. This extra cysteine residue is 
preferably added to the amino terminus and facilitates 
the coupling of the peptide to a carrier protein which 
is necessary to render the small peptide immunogenic. 
If the peptide is to be labeled for use in 
radioimmunoassays, it may be advantageous to synthesize 
the protein with a tyrosine attached to either the 
amino or carboxyl terminus to facilitate iodination. 
These peptides therefore possess the primary sequence 
of the peptides listed in Table 1 but with additional 
amino acids which do not appear in the primary sequence 
of the protein and whose sole function is to confer the 
desired chemical properties to the peptides. 

The invention also relates to any polypeptide 
according to the invention labeled by an appropriate 
label of the enzymatic, fluorescent, radioactive type. 

The invention also relates to a process for 
detecting in vitro antibodies related to tuberculosis 
in a human biological sample liable to contain them, 
this process comprising 
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- contacting the biological sample with a polypeptide 
or a peptide according to the invention under 
conditions enabling an in vitro immunological reaction 
between said polypeptide and the antibodies which are 
possibly present in the biological sample and 

- the in vitro detection of the antigen/antibody 
complex which may be foxmed. 

Preferably, the biological medium is constituted 
by a human serum. 

The detection can be carried out according to any 
classical process. 

By way of example, a preferred method brings into 
play an immunoenzymatic process according to an ELISA, 
immunofluorescent, or radio immune logical (RIA) 
technique, or the equivalent ones. 

Such a method for detecting in vitro antibodies 
related to tuberculosis comprises for instance the 
following steps: 

- deposit of determined amounts of a polypeptidic 
composition according to the invention in the wells of 
a titration microplate, 

" introduction into said wells of increasing dilutions 
of the serum to be diagnosed, 
incubation of the microplate, 

- repeated rinsing of the microplate, 

- introduction into the wells of the microplate of 
labeled antibodies against the blood immunoglobulins, 

- the labeling of these antibodies being based on the 
activity of an enzyme which is selected from among the 
ones which are able to hydrolyze a substrate by 
modifying the absorption of the radiation of this 
latter at least at a given wavelength, 

- detection by comparison with a control standard of 
the amount of hydrolyzed substrate. 

The invention also relates to a process for 
detecting and identifying in vitro antigens of 
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tuberculosis in a human biological sample liable to 
contain them, this process comprising: 

- contacting the biological sample with an appropriate 
antibody of the invention under conditions enabling an 
in vitro immunological reaction between said antibody 
and the antigens of tuberculosis which are possibly 
present in the biological sample and the in vitro 
detection of the antigen/antibody complex which may be 
formed. 

Preferably, the biological medium is constituted 
by sputum, pleural effusion liquid, broncho-alveolar 
washing liquid, urine, biopsy or autopsy material. 

The invention also relates to an additional method 
for the in vitro diagnosis of tuberculosis in a patient 
liable to be infected by Mycobacterium tuberculosis 
comprising the following steps: 

- the possible previous amplification of the amount of 
the nucleotide sequences according to the invention, 
liable to be contained in a biological sample taken 
from said patient by means of a DNA primer set as 
defined above, 

- contacting the above-mentioned biological sample with 
a nucleotide probe of the invention, under conditions 
enabling the production of an hybridization complex 
formed between said probe and said nucleotide sequence, 

- detecting the abovesaid hybridization complex which 
has possibly been formed. 

To carry out the in vitro diagnostic method for 
tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis as defined above, the 
following necessary or kit can be used, with said 
necessary or kit comprising: 

- a determined amount of a nucleotide probe of the 
invention. 
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- advantageously the appropriate medium for creating an 
hybridization reaction between the sequence to be 
detected and the above mentioned probe ^ 

- advantageously, reagents enabling the detection of 
the hybridization complex which has been formed between 
the nucleotide sequence and the probe during the 
hybridization reaction • 

The invention also relates to an additional method 
for the in vitro diagnosis of tuberculosis in a patient 
liable to be infected by Mycobacterium tuberculosis 
comprising: 

- contacting a biological sample taken from a patient 
with a polypeptide or a peptide of the invention, under 
conditions enabling an in vitro immunological reaction 
between said polypeptide or peptide and the antibodies 
which are possibly present in the biological sample and 

- the in vitro detection of the antigen/antibody 
complex which has possibly been formed. 

To carry out the in vitro diagnostic method for 
txiberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis , the following necessary or 
kit can be used, with said necessary or kit comprising: 

- a polypeptide or a peptide according to the 
invention, 

- reagents for making a medium appropriate for the 
immunological reaction to occur, 

- reagents enabling to detect the antigen/antibody 
complex which has been produced by the immunological 
reaction, with said reagents possibly having a label, 
or being liable to be recognized by a labeled reagent, 
more particularly in the case where the above-mentioned 
polypeptide or peptide is not labeled. 

The invention also relates to an additional method 
for the in vitro diagnosis of tuberculosis in a patient 
liable to be infected by M^ tuberculosis , comprising 
the following steps: 
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- contacting the biological sample with an appropriate 
antibody of the invention under conditions enabling an 
is vitro immunological reaction between said antibody 
and the antigens of M> tuberculosis which are possibly 
present in the biological sample and the in vitro 
detection of the antigen/antibody complex which may be 
formed. 

To carry out the in vitro diagnostic method for 
tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis , the following necessary or 
kit can be used, with said necessary or kit comprising: 

- an antibody of the invention, 

- reagents for making a medium appropriate for the 
immunological reaction to occur, 

reagents enabling the detection of the 
antigen/antibody complexes which have been produced by 
the immunological reaction, with said reagent possibly 
having a label or being liable to be recognized by a 
labeled reagent, more particularly in the case where 
the above-mentioned antibody is not labeled. 

An advantageous kit for the in vitro diagnosis of 
tuberculosis comprises: 

- at least a suitable solid phase system, e.g. a 
microtiter-plate for deposition thereon of the 
biological sample to be diagnosed in vitro , 

- a preparation containing one of the monoclonal 
antibodies of the invention, 

- a specific detection system for said monoclonal 
antibody, 

- appropriate buffer solutions for carrying out the 
immunological reaction between the biological sample 
and said monoclonal antibody on the one hand, and the 
bonded monoclonal antibodies and the detection system 
on the other hand. 

The invention also relates to a kit, as described 
above, also containing a preparation of one of the 
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polypeptides or peptides of the invention, with said 
antigen of the invention being either a standard (for 
quantitative determination of the antigen of M. 
tuberculosis which is sought) or a competitor, with 
respect to the antigen which is sought, for the kit to 
be used in a competition dosage process. 

The invention also relates to a necessary or kit 
for the diagnosis of prior exposure of a subject to 
tuberculosis , with said necessary or kit containing a 
preparation of at least one of the polypeptides or 
peptides of the invention, with said preparation being 
able to induce in vivo, after being intradermally 
injected to a subject, a delayed-type hypersensitivity 
reaction at the site of injection, in case the subject 
has had prior exposure to tuberculosis . 

This necessary or kit is called a skin test. 
The invention also relates to an immunogenic 
composition comprising a polypeptide or a peptide 
according to the invention, in association with a 
pharmaceutical ly acceptable vehicle. 

The invention also relates to a vaccine 
composition comprising among other immunogenic 
principles any one of the polypeptides or peptides of 
the invention or the expression product of the 
invention, possibly coupled to a natural protein or to 
a synthetic polypeptide having a sufficient molecular 
weight so that the conjugate is able to induce in vivo 
the production of antibodies neutralizing Mycobacterium 
tuberculosis , or induce in vivo a cellular immune 
response by activating tuberculosis antigen- 

responsive T cells. 

The peptides of the invention which are 
advantageously used as immunogenic principle are the 
ones mentioned in Table 1. 
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Other characteristics and advantages of the 
invention will appear in the following examples and the 
figures illustrating the invention. 

FIGURE LEGENDS 

Figure 1: 

Figure 1 represents the nucleotide and amino acid 
sequence of the 85C antigen containing region of M, 
tuberculosis , 

The previously identified 28-residue NH2-terminal 
amino acid sequence of the mature protein is underlined 
with a double line. One additional ATG codon, 
downstream from of the ATG at position 150 is 
underlined. Since the precise length of the signal 
sequence could not be determined, the option taken here 
represents the 4 6 amino acid signal peptide 
corresponding to ATG^jq. The putative signal peptide 
sequence is represented in italic capitals. The top 
drawing represents the sequencing strategy. Arrows 
indicate the direction of dideoxy-sequencing either in 
DNA subcloned as double stranded DNA in Blue Scribe 
M13+ or as single stranded DNA in the mpl8 M13 vector. 
The entire sequence was determined using synthetic 
oligonucleotides represented as gray boxes on the 
figure. 
Figure 2: 

Figure 2 represents the homology between known 
nucleotide and amino acid sequence of the antigen 85 
and the 85C antigen of tuberculosis ; 

A- Comparison of the DNA sequences of antigen 85A, 
B and C: 

DNA sequences have been aligned with the "Align" 
program which visualizes multiple alignments. In this 
presentation, sequence differences are outlined: 

(•) indicate identical residues ; (-) indicates a 
gap ; (any letter) indicates a substitution. 
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All the sequences are compared and aligned to that 
of the first line (gene 85A) . 

85A-TUB: DNA sequence from ML_ tuberculosis 
(Borremans L, et al., 1989, "Cloning, sequence 
determination and expression of a 32-kilodalton protein 
gene of Mycobacterium tuberculosis " Infect. Immun. 
57:3123) . 

85B-BCG: DNA sequence from a-antigen of 
Mycobacterium bovis (strain Tokyo) (Mat suo K. et al,, 
1988, "Cloning and expression of the Mycobacterium 
bovis BCG gene for extracellular a-antigen" J. 
Bacterid. 170:3847). 

85C-TUB: DNA sequence from antigen 85C from 
Mycobacterium tuberculosis (the present invention) . 

85B-KAN: DNA sequence from antigen 85B from M. 
kansasii (Matsuo K. et al., 1990, "Cloning and 
expression of the gene for cross-reactive a antigen of 
Mycobacterium kansasii " Infect. Immun. 58:550). 

85C-BCG: Partial DNA secjuence from Mycobacterium 
bovis BCG strain 1173P2 (the present invention) . This 
sequence was obtained from a cloned PCR amplified DNA 
fragment. 

r 

( *-) indicates the presumed initiation codon for each 
gene. 

( ^ ) indicates the first phenylalanine residue of the 
mature protein. 

(^^ ) indicates the termination codon of each gene. 

P78 and P79 are sense and antisense primers used 
for PCR amplification 

85A, -B, -C sequences used for the synthesis of 
specific synthetic oligonucleotides probes are framed. 

The indicated restriction sites have been used to 
prepare the three type-specific probes (see also Figure 
4A) . 
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B- Comparison of the pre-protein sequences of 
antigen 85A, B and C: 

DNA sequences have been aligned with the "Align" 
program which permits multiple alignments. In this 
presentation, sequence differences are outlined: 

(•) indicate identical residues ; (-) indicates a 
gap ; (any letter) indicates a sxibstitution. 

All the sequences are compared and aligned to that 
of the first line (gene 85A) . 

85A: Protein sequence from tuberculosis 
(Borremans L. et al,, 1989, "Cloning, sequence 
determination and expression of a 32-kilodalton protein 
gene of Mycobacterium tuberculosis " Infect. Immun. 
57:3123) . 

85B: Protein sequence from a-antigen of 
Mycobacterium bovis (strain Tokyo) (Mat suo K. et al., 
1988, "Cloning and expression of the Mycobacterium 
bovis BCG gene for extracellular a-antigen" J. 
Bacteriol. 170:3847), 

85C: Protein sequence from antigen 85C from 
Mycobacterium tuberculosis (the present invention) . 

85B-KAN: Partial protein sequence from antigen 85B 
from M^ kansasii (Matsuo K. et al., 1990, "Cloning and 
expression of the gene for cross-reactive a antigen of 
Mycobacterium kansasii " Infect. Immun. 58:550) . 

85C-BCG: Partial protein sequence from 
Mycobacterium bovis BCG strain 1173P2 (the present 
invention) . 

The "C" characteristic motif is framed. 
Figure 3: 

Figure 3 represents the hydropathy pattern of the 
M^ tuberculosis 32-kDa (antigen 85A) , the a-antigen of 
BCG (antigen 85B) and antigen 85C from M^ tuberculosis , 
amino acid secjuences: 

The sequence of the three pre-proteins (including 
the presumed signal peptide signals) have been analyzed 
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using the Kyte and Doolittle method (Borremans L. et 
al.^ 1989, "Cloning, sequence determination and 
expression of a 32-kilodalton protein gene of 
Mycobacterium tuberculosis " Infect. Immun. 57:3123) 
with a window of eight amino acids. Each bar on the 
axes represents 50 amino acids. Since the length of 
signal sequences are slightly different (43, 40 and 46 
residues for the three proteins 85A, 85B, 85C) the 
patterns are aligned to the first residue of the three 
mature proteins. Plain lines are used to align 
hydrophobic peaks and a dashed line to align 
hydrophilic peaks. 
Figures 4A and 4B: 

Figure 4A represents the restriction endonuclease 
maps of the three genes 85A, 85B and 85C: type-specific 
probes are marked by <--->. 

The map of gene 85A is derived from Borr et al. 
(Borremans L. et al., 1989, "Cloning, sequence 
determination and expression of a 32-kilodalton protein 
gene of Mycobacterium tuberculos is " Infect. Immun. 
57:3123). The map of 85B was obtained from clone 5.1 
derived from our Mycobacterium bovis BCG 1173P2 Agtll 
recombinant library (De Wit L. et al., 1990, 
"Nucleotide sequence of the 32 kDa-protein gene 
(antigen 85A) of Mycobacterium bovis BCG " Nucl. Ac. 
Res. 18:3995) . For the restriction enzymes used, this 
map is identical to that published for M^ bovis BCG 
(strain Tokyo) (Matsuo K. et al., 1988, "Cloning and 
expression of the Mycobacterium bovis BCG gene for 
extracellular a-antigen" J. Bacterid. 170 :3847) . The 
coding region of the 85B antigen is positioned 
according to Matsuo et al. (Matsuo K. et al., 1988, 
"Cloning and expression of the Mycobacterium bovis BCG 
gene for extracellular a-antigen" J. Bacterid. 
170:3847) . 
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The map of 85C corresponds to the restriction map 
of clone 11.2 that was obtained from the 
tuberculosis Agtll library from R. Young (Young R.A* et 
al., 1985, "Dissection of Mycobacterium tuberculosis 
antigens using recombinant DNA" Proc. Natl. Acad. Sci. 
USA 82.:2583) (Materials and Methods). The position of 
the specific 5 ' DNA restriction fragment used for 
Southern analysis is indicated on each map by a double 
arrow. 

Figure 4B represents the Southern analysis of the 
total genomic DNA from Mycobacterium bovis BCG (strain 
1173P2) : 

Fifteen /xg DNA of digested DNA was applied per 
lane. Hybridization was with oligonucleotide probes A, 
B, C (as described in Fig. 2A) under the conditions 
described in Materials and Methods. Molecular weight of 
the hybridizing bands were calculated by comparison 
with standards. 

Figure 4C represents the Southern analysis of 
total genomic DNA from M^ bovis BCG 1173P2. The 
procedure described for Figure 4B was used. 

The three probes, however, were large DNA 
restriction fragments (as defined in Figure 4A) . 

Parts 85A and 85B were obtained from a single 
filter, whereas 85C was from a separate run. 
Figure 5: 

Figure 5 represents the pulse field 
electrophoresis of Mycobacterium tuberculosis DNA: 

DNA from three strains of Mycobacterium 
tuberculosis was digested with Dra l and separated by 
Pulse field electrophoresis on an agarose gel together 
with a bacteriophage A DNA 'ladder' as described in 
Materials and Methods. After transfer to nylon filters, 
hybridization with the three probes 85A, 85B, 85C was 
as described under Fig. 4A. Molecular weights of the 
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hybridizing bands were calculated by comparison with 
those of the A DNA "ladder'. 

MjlTERIALS AND METHODS 

1. Preparation of genomic DNA (Thole J. et al., 1985, 
"Cloning of Mycobacterium bovis BCG DNA and expression 
of antigens in Escherichia coli " Infect. Immun. 
50:3800) : 

M. bovis BCG was cultivated at 37 *C in Sauton 
medium and harvested after an additional incubation of 
18 h in the presence of 1% glycine added at the end of 
the late exponential growth phase. The bacteria were 
treated with lysozyme and proteinase K, lysed with 
sodium dodecyl sulfate, phenol extracted and ethanol 
precipitated . 

2. Genomic libraries: 

A Agtll recombinant library constructed from 
genomic DNA of tuberculosis (Erdman strain) , was 

obtained from Young R.A. et al., 1985, "Dissection of 
Mycobacterium tuberculosis antigens using recombinant 
DNA" Proc. Natl. Acad. Sci. USA 82:2583. 

A second Agtll recombinant library was prepared 
with genomic DNA from M^ bovis BCG (De Wit L. et al., 
1990, "Nucleotide sequence of the 32 IcDa-protein gene 
(antigen 85A) of Mycobacterium bovis BCG " Nucl. Ac. 
Res. 18:3995) . 

3 . Oligonucleotides : 

Oligonucleotides were synthesized on an Applied 

Biosystems DNA synthesizer model 381A, purified on 

OPC-cartridges (Applied Biosystems) , lyophilized and 

dissolved in TE buffer (10 mM Tris-HCl, pH 7,4). 

^^P labeling of the oligonucleotides was as 

described in Sambrook J. et al., 1989, "Molecular 

Cloning: a Laboratory Manual" Cold Spring Harbor 

Laboratory, Cold Spring Harbor, N.Y. 
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4* PGR: 

50 ng of MycobacteriuiQ bovis BCG DNA was amplified 
in a 50-^1 reaction mixture containing 1 x PCR-buffer 
(Amersham) , 200 ^M dNTP, 1 fiK each of sense P78 (5«- 
CCGGAATTCATGGGCCGTGACATCAAG) and antisense P79 (5'- 
CC GGAATTC GGTCTCCCACTTGTAAGT ) oligonucleotide primers 
(the location of these two primers is indicated in 
Figure 2A. To both oligonucleotides were added an EcoRI 
sequence preceded by 3 additional nucleotides) , and 2 
units of Tag DNA polymerase • After denaturation for 90 
seconds at 94 'C the reaction was submitted to 4 0 cycles 
consisting of 1 minute at 93 (denaturation), 90 
seconds at 55 (annealing), 2 minutes at 72 *C 
(extension) , followed by a 5 minute final extension at 
72 "C. After extraction with 150 m1 chloroform, the 
amplified DNA was washed three times with 0.75 ml HgO 
in a Centricon-30 for 6 minutes at 6500 rpm in the 
Sorvall SS 34 rotor. After digestion with Eco RI the DNA 
was ligated into EcoR I -digested , phosphatase-treated 
Bluescribe-M13+ vector. DH5a coli (Gibco-BRL) were 
transformed and plated on Hybond-N filters. Colonies 
were selected by hybridization with ^^P-labeled 
oligonucleotide probe-A ( 5 ' -TCGCCCGCCCTGTACCTG) and 
oligonucleotide probe-B (5 « -TCACCTGCGGTTTATCTG) . 

Hybridization and washing conditions for the 
oligonucleotides were as described by Jacobs et al. 
(Jacobs et al., 1988, "The thermal stability of 
oligonucleotide duplexes is sequence independent in 
tetraalkyl ammonium salt solutions: application to 
identifying recombinant DNA clones" Nucl. Acid Res. 
16:4637) . 

5. Screening of the Agtll tuberculosis and 

Mycobacterium bovis BCG recombinant DNA libraries: 

The two Agtll recombinant libraries were screened 
by colony hybridization (Sambrook J. et al., 1989, 
"Molecular Cloning: a Laboratory Manual" Cold Spring 
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Harbor Laboratory, Cold Spring Harbor, N.Y.) with a 800 
bp Hind lll fragment of the previously cloned gene 85A 

(Borremans L, et al., 1989, "Cloning, sequence 
determination and expression of a 32-kilodalton protein 
gene of Mvcobacteriixm tuberculosis " Infect. Immun. 
57:3123) which does not discriminate gene 85A from 85B 

(see Fig. 2A and 4A) . Twelve positive tuberculosis 
and 12 Mycobacterium bovis BCG plaques were retained 
and screened by hybridization with ^^P-labeled 
oligonucleot ide-probe C ( 5 ' -TCGCAGAGCAACGGGCAGAACTAC ) 
as described above. 

From the tuberculosis Agtll library, one 

selected bacteriophage #11 was partially digested with 
EcoRI and its 5 kbp insert was subcloned in 
Bluescribe-M13+, From this recombinant plasmid named 
11-2, a 3,500 bp BamHI-EcoRI fragment was subcloned in 
M13-mpl8 and M13-mpl9 (Sambrook J. et al., 1989, 
"Molecular Cloning: a Laboratory Manual" Cold Spring 
Harbor Laboratory, Cold Spring Harbor, K.Y.). 

6. Recombinant DNA analysis: 

It was as described in Borremans L. et al., 1989, 
"Cloning, sequence determination and expression of a 
32-kilodalton protein gene of Mycobacterium 
tuberculosis " Infect- Immun. 57:3123. 

7. Sequencing: 

Sequence analysis was done by the primer extension 
dideoxy termination method of Sanger et al. (Sanger F. 
et al., 1977, "DNA sequencing with chain termination 
inhibitors" Proc. Natl. Acad. Sci. USA 74:5463) after 
subcloning of specific fragments in Bluescribe-M13+ 
(Chen E.J. et al., 1985, "Supercoil sequencing: a fast 
simple method for sequencing plasmid DNA" DNA _4:165) or 
in mpl8 and mpl9 M13 vectors. Sequence analysis was 
greatly hampered by the high GC content of the M. 
tuberculosis DNA (65%) . Sequencing reactions were 
therefore performed with several DNA polymerases 
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according to manufacturers protocols: T7 DNA polymerase 
("Sequenase" USB), T7 DNA polymerase (Pharmacia), and 
Tag DNA polymerase (Promega) using 7-deaza-dGTP instead 
of dGTP. Several oligodeoxynucleotides were synthesized 
and used to focus on ambiguous regions of the sequence. 
The sequencing strategy is summarized in Fig. 1. 

8. Sequence comparison and analysis: 

Routine computer-aided analysis of the nucleic 
acid and deduced amino acid sequences were performed 
with the LGBC program from Bellon B. , 1988, "Apple 
Macintosh programs for nucleic and protein sequence 
analysis" Nucleic Acid Res. 16:1837. Homology searches 
used the FASTA programs from Pearson W.R. et al., 1988, 
"Improved tools for biological sequence comparison" 
Proc. Natl. Acad. Sci. USA 85:2444, and the various DNA 
and protein data bank from the EMBL-server facilities. 
Multiple alignments were obtained with 'Align 1.01' 
(Scientific and Educational Software) . 

9. Southern blot analysis: 

Genomic DNA from Mycobacterium bovis BCG was 
completely digested with Sphl, EcoRI or Kpnl, 
electrophoresed on a 1% agarose gel, transferred to 
Hybond-N filter (Amersham) after denaturation and 
neutralization and either hybridized with ^^p.^abeled- 
oligonucleotide probes (A, B, C) in the conditions 
described in Jacobs et al., 1988, "The thermal 
stability of oligonucleotide duplexes is sequence 
independent in tetraalkylammonium salt solutions: 
application to identifying recombinant DNA clones" 
Nucl. Ac. Res. 1^:4637, or random-primed ^^p.^^j^^j^^ 
DNA restriction fragments that were found to 
discriminate the 3 genes 85A, 85B, and 85C. 

Probe 85A was a 230 bp PstI fragment from plasmid 
BY-5 (Borremans L. et al., 1989, "Cloning, sequence 
determination and expression of a 32-kilodalton protein 
gene of Mycobacterium tuberculosis " Infect. Immun. 
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57:3123 and Fig. 2A) . Probe 85B was a 400 bp Smal-EcoRV 
fragment from a 85B recombinant plasmid named 5, If 
derived from our Mycobacterium bovis BCG Agtll library, 
whose map is presented in Fig. 4A (see also Fig. 2A) . 
Probe 85C was a 280 bp Sma l -Kpn l fragment from plasmid 
11.2 (see also Fig. 4A and 2A) . 

These DNA fragments were prepared by gel 
electrophoresis on low melting point agarose followed 
by a rapid purification on Qiagen (marketed by: 
Westburg, Netherlands) (tip 5) according to 
manufacturers protocol and labeled in the presence of 
c^_32p.^icTP (Feinberg A. P. et al., 1983, "A technique 
for radiolabeling DNA restriction endonuclease 
fragments to high specific activity" Anal. Biochem. 
132 :6) . 

10. Pulse Field electrophoresis DNA separation: 
DNA preparation, restriction enzyme digestion and 
pulse-field gel electrophoresis were performed as 
described by Vincent Ijevy-Frebault V. et al., 1990, 
("DNA polymorphism in Mycobacterium para tuber cul os is , 
"wood pigeon mycobacteria" and related mycobacteria 
analyzed by field inversion gel electrophoresis", J. 
Clin. Microbiol. 27:2723) • Briefly cells from fresh 
cultures were mixed with 1% low-melting-point agarose 
(v/v) and submitted to successive treatments with 
zymolase (Seikagaki Kogyo, Tokyo, Japan), lysozyme, and 
sodium dodecyl sulfate in the presence of proteinase K 
(Boehringer GmbH, Mannheim, Germany) . After 
inactivation of proteinase K with phenylmethylsulfonyl 
fluoride (Bio-Rad Laboratories) , agarose blocks were 
digested overnight with 50 U of Dral (Bio-Rad 
Laboratories) . Then blocks were loaded into a 1% 
agarose gel prepared and electrophoresed in 0.66 TBE 
(Tris-boric acid - EDTA) . Field inversion gel 
electrophoresis was carried out using a Dnastar Pulse 
(Dnastar, USA) apparatus. Forward and reverses pulses 
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were set at 0.33 sec and 0.11 sec at the beginning of 
the run and 60 sec and 20 sec (or 30 sec and 10 sec) at 
the end of the run depending on the molecular weight 
zone to be expanded. The run time was set at 36 h, the 
voltage used was 100 V and producing about 325 mA and 
temperature was maintained at 18 'C. Lambda concatemers 
were used as molecular weight markers. At the end of 
the run, the gels were stained with ethidium bromide, 
photographed under UV light and transferred onto nylon 
membranes according to Maniatis T. et al., 1982, 
"Molecular cloning: a laboratory manual" Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y. 545 pp. 

RESULTS 

1. Cloning of the 85C gene of Kj^ tuberculosis; 

Since no specific probe or monoclonal antibody was 
available to detect specifically an 85C or related 
antigen which was expected to bear extensive homology 
to gene 85A and gene 85B, this screening required the 
development of a new procedure. The strategy used was 
based on the PGR amplification of a 245 bp DNA fragment 
coding for amino acids 18-98 of the mature antigen 85A 
chosen because it is surrounded at both ends by highly 
conserved DNA sequences when the sequences of antigen A 
and B are aligned (see primers P78 and P79 in Fig. 2A) . 
It was thus supposed that an equivalent homology might 
exist with the sequence of antigen 85C in the same 
region. 

From Mycobacterium bovis BCG genomic DNA, a 245 bp 
DNA fragment was readily obtained. The latter was 
purified and subcloned in a Bluescribe M13+ vector 
after digestion with EcoR I . About 80 recombinant 
plasmid-containing colonies were tested by plating on 
nylon filters and hybridized under stringent conditions 
with a labeled synthetic oligonucleotide recognizing 
either sequence 85A (5 '-TCGCCCGCCCTGTACCTG) or sequence 
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85B (5^-TCACCTGCGGTTTATCTG) within the PGR amplified 
fragment (see Fig. 2A) . Several clones that hybridized 
with each oligonucleotide probe were sequenced and the 
sequences were all identical to sequence 85A in the 
clones hybridizing with oligoprobe A and to sequence 
85B for those hybridizing with oligoprobe B. Several of 
the remaining clones were sequenced and they all showed 
a marked sequence divergence from 85A and 85B covering 
a 24-nucleotide stretch which is totally distinct from 
sequence A and B (Fig. 2A, box marked C) (The homology 
to sequence B is only 33% in this region) . Assuming 
these inserts might represent an amplified fragment of 
the 85C gene and that this 24 nucleotide sequence is 
characteristic of the putative 85C gene, an 
oligonucleotide probe (oligo 85C) based on this 
sequence was synthesized. 

The latter probe was labeled with ^^P and used to 
screen a collection of 24 Agtll recombinant phages that 
were selected in our tuberculosis and Mycobacterium 
hovis BCG Agtll libraries by hybridization with a 800 
bp non-specific Hindlll DNA fragment of the previously 
cloned gene 85A. 

One hybridizing Aqtll-M. tuberculosis recombinant 
was retained^ characterized by restriction mapping and 
sequenced. 

2. Sequence of the 850 gene of Mycobacterium 
tuberculosis ; 

The 1211 nucleotide sequence derived from various 
sequenced fragments is represented in Fig. 1. The DNA 
sequence contains a 1, 020-bp-long open reading frame ^ 
starting at position 150 and ending with a TGA codon at 
position 1170. The common NH2 terminal amino acid 
sequence of the antigen 85 proteins, Phe-Ser-Arg-Pro- 
Gly-Leu (De Bruyn J. et al., 1987, "Purification, 
partial characterization and identification of a 32 kDa 
protein antigen of Mycobacterium bovis BCG " Microb. 
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Pathogen- 2:351) could be located within this open 
reading frame from the nucleotide sequence beginning 
with a TTC codon at position 288 (Fig. 1) . Therefore, 
the DNA region upstream from this sequence is expected 
to code for a signal peptide required for the secretion 
of this antigen. The mature protein consists of 294 
amino acid residues corresponding to a calculated 
molecular weight of 32,021. 

Interestingly, the N-terminal sequence of the 
mature protein contains the entire 26 amino acid 
sequence (phe-ser-arg-pro-gly-leu-pro-val-glu-tyr-leu- 
gln-val-pro-ser-ala-ser-met-gly-arg-asp-ile-lys-val- 
gln-phe) described by Wiker H.G. et al., 1990, 
"Evidence for three separate genes encoding the 
proteins of the mycobacterial antigen 85 complex" 
Infect. Immun. 58:272, and which differs only from the 
common 85B and 85A sequence by an alanine instead of a 
proline in position 16 of the mature protein. Two ATG 
codons were found to precede the TTC phenylalanine 
codon at nucleotide position 288 (Fig. 1) in the same 
reading frame. Use of these two ATG would lead to the 
synthesis of signal peptides of either 21 or 46 amino 
acid residues (the latter situation has been 
represented in Fig. 1 for reasons indicated below) . 

The base composition of antigen 850 gene was 
identical to that of the 85A gene with an overall G-C 
composition of 64.57% and a strong preference for G or 
C in codon position 3 (average 85%) . In contrast to 
antigen 85A and 85B that contain 3 cysteins, the 
sequence of antigen 85C shows a single cystein residue 
at position 254. In fact, the two substituted cysteins 
are located in the region of the mature 85C protein 
which contains the largest divergent sequence bloc 
(Fig. 2B) (SQSNGQNY) (The corresponding DNA sequence 
was used to synthesize the oligonucleotide probe "C" 
(see above)). Not surprisingly, this hydrophilic region 
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is also the most divergent when the hydropathy plots of 
the 3 antigens are compared and thus could be either a 
variable "epitope" of all 85-antigens and/or a 
characteristic epitope of antigen 85C since it was also 
found in antigen BSC from bovis BCG (Figure 2B, 
fifth line) . 

Another characteristic feature of antigen 85C is 
the presence of the unusual hydrophobic repetitive 
proline alanine motive PPAAPAAPAA at the carboxy- 
terminal of the molecule. 

3. Hydropathy pattern: 

The hydropathy pattern of tuberculosis 85C 

antigen was determined by the method of Kyte and 
Doolittle (Kyte J. et al., 1982, "Simple method for 
displaying the hydropathy character of a protein" J . 
Mol. Biol. 157:105). The octapeptide profiles were 
compared to antigen 85A and 85B (Fig. 3) . As 
anticipated from the amino acid sequences, the patterns 
are roughly similar for the three antigens except for 
some major differences at region 84-92 and in the 
carboxy- terminal part of the three proteins. 

4. Sequence homologies: 

DNA sequences from antigen 85A (Borremans I*, et 
al., 1989, "Cloning, sequence determination and 
expression of a 32-kilodalton protein gene of 
Mycobacterium tuberculos is " Infect. Immun. 57:3123 ; De 
wit L. et al., 1990, "Nucleotide sequence of the 32 
JcDa-protein gene (antigen 8 5 A) of Mycobacterium bovis 
BCG " Nucl. Ac. Res. 18:3995), 85B (Matsuo K. et al., 
1988, "Cloning and expression of the Mycobacterium 
bovis BCG gene for extracellular a-antigen" J. 
Bacterid. 170 :3847 ; Matsuo et al. , 1990, "Cloning and 
Expression of the gene for cross-reactive a-antigen of 
M. kansasii " Infect. Immunity 58:550-556) and 85C were 
aligned. An alignment of the three DNA sequences is 
shown in Fig. 2A. At the DNA level, the homology is 
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maximal between the regions coding for the 3 mature 
proteins. In this region, the homology between A and B 
is 77.5% whereas it reaches only 70.8% between the 
coding regions of genes A and C and 71.9% between B and 
C, respectively. Beyond nucleotide 1369 of sequence 85A 
and upstream from nucleotide position 475 (i.e. within 
the signal secjuence and promoter region) there is 
practically no homology between the 3 sequences. No 
significant homology was detected to other DNA 
sequences present in the latest release of GenBank- 
EMBL. 

Homologies at the amino acid level, are presented 
in the alignment in Fig. 2B, again indicating a higher 
homology between sequences A and B (80.4%) than between 
B/C or A/C. 

Other comparisons between the 85C antigen and the 
entire SwissProt-NBRF data bank failed to detect any 
significant homologies to the 85C antigen amino acid 
sequence. As for the 85A antigen, the 850 sequence does 
not contain the RGD motif of fibronectin binding 
proteins nor does it share any homology to the known 
fibronectin receptors or to the fibronectin binding 
protein from Staphy 1 ococcus aureus. 

Comparison of the partial PCR derived DNA sequence 
of the 85C gene of bovis BCG 1173P2 with that of 
Mycobacterium tuberculosis shows complete identity 
including the characteristic region corresponding to 
synthetic oligonucleotide C (see Figure 2A) . 

5. Genome characterization: 

In order to confirm the existence of different 
genes coding for the antigen 85 complex bovis BCG 
genomic DNA was digested with Sph I, EcoR I and Kpn l and 
the distribution of radioactive signals was examined in 
Southern blot after hybridization with three specific 
oligonucleotide (A, B, C) probes (see Materials and 
Methods and Fig. 2A) . Three clearly distinct patterns 
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were obtained confirming the specificity of these 
probes. Similar type specific profiles could be 
obtained with three random-priming-labeled DNA 
restriction fragments (probe 85A, 230 bp; 85B, 400 bp; 
85C 280 bp) which were selected within the promoter 
signal sequence of the three DNAs (Fig. 2A and 4A) . 
With these three DNA restriction fragments, additional 
weak bands are also observed which clearly correspond 
to cross hybridization of the probes to the other two 
genes. With probe 85C, an additional Kpn l fragment was 
observed that does not hybridize to the C- 
oligonucleotide probe. This probably indicates that the 
corresponding Kpn l site is located upstream from this 
gene. Furthermore the size of the observed restriction 
fragments are not always exactly as expected from the 
restriction maps of the corresponding cloned genes. 
These discrepancies probably correspond to some minor 
sequence differences (restriction polymorphism) 
possibly in non coding DNA regions (outside of the DNA 
coding for the antigen 85) between strain of M^^ bovis 
BCG and the bovis BCG (strain Tokyo) and M. 

tuberculosis respectively. 

6. Pulse field analysis of tuberculosis genomic 

DNA: 

When the largest available 85A clone BY-5 was 
hybridized (Fig. 4A) with oligonucleotide probe B, no 
positive signal was detected whereas oligonucleotide 
probe A gave a positive hybridization (not shown) • This 
indicates that gene B is not located within 2-2.5 kb of 
the 5' and 4.0 kb of the 3' border of gene A (Fig. 4A) . 
To confirm and extend this result, pulse-field 
separated Dral-digested tuberculosis genomic DNA was 
further hybridized with the three specific DNA 
restriction fragments as probes (85A, 85B and 85C) 
under stringent conditions. 
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Eight strains of tuberculosis were compared 

showing six different patterns, three of which are 
illustrated in Fig. 5. For most strains examined, the 
three probes hybridized to fragments of different 
sizes. For instance, in tuberculosis H37Ra, the 

respective size of the Dra l fragments hybridizing with 
probes 85A, B and C were about 242 kb, 212 kb and 225 
kb for strain H37Ra, 403 kb, 212 kb and 104 kb for 
strain H37Rv and 355 kb, 104 kb and 153 kb for strain 
"1025". Although various strains show some restriction 
fragment length polymorphism with restriction 
endonuclease Dra l, the simplest interpretation of these 
results is that the three antigen 85 genes are 
distantly located (> 100 kb) within the mycobacterial 
genome • 



wo 92/14823 



PCr/EP92/00268 



50 
CLAIMS 

1, Nucleic acid 

* containing a nucleotide sequence extending from the 
extremity constituted by the nucleotide at position (1) 
to the extremity constituted by the nucleotide at 
position (149) represented on Figure 1, 

* or containing one at least of the nucleotide 
sequences coding for the following peptides or 
polypeptides : 

the one extending from the extremity constituted 
by amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- the one extending from the extremity constituted 
by amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- SQSNGQNY, or 

- PMVQIPRLVA, or 

- GLTLRTNQTFRDTYAADGGRNG, or 

- PPAAPAAPAA, 

* or containing nucleotidic sequences: 

- hybridizing with the above-mentioned nucleotide 
sequences or their complements, 

- complementary to the above-mentioned nucleotide 
sequences , or 

- which are the above-mentioned nucleotide 
sequences wherein T can be replaced by U. 

2. Nucleic acid according to claim 1, containing 
the nucleotide sequence coding for the following 
peptide: 

SQSNGQNY 

and possibly containing the nucleotide sequence coding 
for the following peptide 

FSRPGLPVEYLQVP 
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and liable to hybridize with the following nucleotide 
sequence: 

CGGCTGGGAC(or T) ATCAACACCCCGGC 

and liable to hybridize neither with 

GCCTGCGGCAAGGCCGGTTGCCAG 

nor with 

GCCTGCGGTAAGGCTGGCTGCCAG 
nor with 

GCCTGCGGCAAGGCCGGCTGCACG . 

3. Nucleic acid according to anyone of claims 1 or 

2, containing an open reading frame coding for a 
polypeptide 

- liable to react selectively with human sera from 
tuberculosis patients and particularly patients 
developing an evolutive tuberculosis, 

or liable to be recognized by antibodies also 
recognizing the amino acid sequence extending from the 
extremity constituted by amino acid at position (1) to 
the extremity constituted by amino acid at position 
(294) represented on Figure 1, 

- or liable to generate antibodies recognizing the 
amino acid sequence extending from the extremity 
constituted by amino acid at position (1) to the 
extremity constituted by amino acid at position (294) 
represented on Figure 1. 

4. Nucleic acid according to anyone of claims 1 to 

3, containing an open reading frame and coding for a 
mature polypeptide of about 3 0 to about 35 kDa and 
containing a sequence coding for a signal sequence. 

5. Nucleic acid according to claim 1, containing a 
nucleic acid coding for the following polypeptides: 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 
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- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
eimino acid at position (-21) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on Figure 
1- 

6. Polypeptide coded by the nucleic acids of 
anyone of claims 1 to 5* 

7. Polypeptide according to claim 6, containing in 
its polypeptide chain one at least of the following 
amino acid sequences: 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, 

- or the one extending from the extremity constituted 
by amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure ±, or 

- SQSNGQNY, or 

- PMVQIPRLVA^ or 

- GLTLRTNQTFRDTYAADGGRNG, or 

- PPAAPAAPAA. 

8. Polypeptide according to claim 7, containing in 
its polypeptide chain the following amino acid 
sequence: 
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SQSNGQNY 

and possibly the amino acid sequence 

GWDINTPA 

and possibly the amino acid sequence 

FSRPGLPVEYLQVP 
and containing not the amino acid sequence 

ACGKAGCQ 

and not the amino acid sequence 

ACGKAGCT 

9. Polypeptide according to anyone of claims 7 or 
8, liable to react selectively with human sera from 
tuberculosis patients and particularly patients 
developing an evolutive tuberculosis, 

or liable to be recognized by antibodies also 
recognizing the polypeptidic sequence extending from 
the extremity constituted by amino acid at position (i) 
to the extremity constituted by amino acid at position 
(294) represented on Figure 1, 

or liable to generate antibodies recognizing the 
polypeptidic sequence extending from the extremity 
constituted by amino acid at position (1) to the 
extremity constituted by amino acid at position (294) 
represented on Figure 1. 

10. Polypeptide according to anyone of claims 7 to 
9, Of about 30 to about 35 kDa and preceded by a signal 
peptide. 

11. Polypeptide according to claim 1, containing 
in its polypeptide chain one at least of the following 
amino acid sequences: 

- the one extending from the extremity constituted by 
amino acid at position (l) to the extremity constituted 
by amino acid at position (294) represented on Figure 
1, or 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
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constituted by amino acid at position (294) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position {-21) to the extremity 
constituted by amino acid at position (294) represented 
on Figure 1^ or 

- the one extending from the extremity constituted by 
amino acid at position (-46) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure 1, or 

- the one extending from the extremity constituted by 
amino acid at position (-21) to the extremity 
constituted by amino acid at position (-1) represented 
on Figure !• 

12. Amino acid sequences constituted by a 
polypeptide according to anyone of claims 6 to 11, and 
a protein or an heterologous sequence with respect to 
said polypeptide, said protein or heterologous sequence 
comprising from about 1 to about 1,000 amino acids, the 
heterologous protein being advantageously jS- 
galactosidase . 

13. Recombinant nucleic acid containing at least 
one of the nucleotide sequences according to anyone of 
claims 1 to 5, inserted in a heterologous nucleic acid. 

14. Recombinant vector, particularly for cloning 
and/or expression, comprising a vector sequence, 
notably of the type plasmid, cosraid, phage DNA or virus 
DNA, and a recombinant nucleic acid according to anyone 
of claims 1 to 5, in one of the non essential sites for 
its replication. 

15. Recombinant vector according to claim 14, 
containing in one of its non essential sites for its 
replication necessary elements to promote the 
expression of polypeptides according to anyone of 
claims 6 to 11 in a cellular host and notably a 
promoter recognized by the RNA polymerase of the 
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cellular host, particularly an inducible promoter anc3 
possibly a sequence coding for transcription 
termination signals and possibly a signal sequence 
and/ or an anchoring sequence. 

16. Recombinant vector according to claim 15, 
containing the elements enabling the expression by 
coli of a nucleic acid according to anyone of claims l 
to 5 inserted in the vector, and particularly the 
elements enabling the expression of the gene or part 
thereof of /5-galactosidase. 

17. Cellular host which is transformed by a 
recombinant vector according to anyone of claims 14 to 
16, and containing the regulation elements enabling the 
expression of the nucleotide sequence coding for the 
polypeptide according to anyone of claims 6 to 11 in 
this host. 

18. Cellular host according to claim 17, chosen 
from among bacteria such as coli . transformed by the 
vector according to anyone of claims 14 to 16, or 
chosen from among eukaryotic organism, transfected by a 
vector according to claims 14 or 15. 

19. Expression product of a nucleic acid expressed 
by a transformed cellular host according to anyone of 
claims 17 or 18. 

20. Antibody characterized by the fact that it is 
directed against a recombinant polypeptide according to 
anyone of claims 6 to 11. 

21. Nucleotidic probes, hybridizing with anyone of 
the nucleic acids according to anyone of claims i to 5 
or with their complementary sequences. 

22. Process for preparing a recombinant 
polypeptide according to anyone of claims 6 to ii 
comprising the following steps: 

- the culture in an appropriate medium of a cellular 
host which has previously been transformed by an 
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appropriate vector containing a nucleic acid according 
to anyone of claims 1 to 5, and 

- the recovery of the polypeptide produced by the 
abovesaid transformed cellular host from the abovesaid 
culture. 

23. Method for the in vitro diagnosis of 
tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis comprising 

- contacting a biological sample taken from a patient 
with a polypeptide according to anyone of claims 6 to 
11, under conditions enabling an in vitro immunological 
reaction between said polypeptide and the antibodies 
which are possibly present in the biological sample and 

- the in vitro detection of the antigen/antibody 
complex which has been possibly formed. 

24. Method for the in vitro diagnosis of 
tuberculosis in a patient liable to be infected by M. 
tuberculosis , comprising the following steps: 

- contacting the biological sample with an appropriate 
antibody according to claim 20, under conditions 
enabling an in vitro immunological reaction between 
said antibody and the antigens of M. tuberculosis 
which are possibly present in the biological sample and 

- the in vitro detection of the antigen/antibody 
complex which may be formed. 

25. Necessary or kit for an in vitro diagnostic 
method for tuberculosis in a patient liable to be 
infected by Mycobacterium tuberculosis according to 
claim 23 , comprising 

- a polypeptide according to anyone of claims 6 to 11, 

- reagents for making a mediiom appropriate for the 
immunological reaction to occur, 

- reagents enabling to detect the antigen/antibody 
complex which has been produced by the immunological 
reaction, said reagents possibly having a label, or 
being liable to be recognized by a labeled reagent. 
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more particularly in the case where the above-mentioned 
polypeptide is not labeled, 

26. Necessary or kit for an in vitro diagnostic 
method for tuberculosis in a patient liable to be 
infected by Mycobacterium tuberculosis according to 
claim 24, comprising 

- an antibody according to claim 20, 

- reagents for making a medium appropriate for the 
immunological reaction to occur, 

- reagents enabling to detect the antigen/antibody 
complexes which have been produced by the immunological 
reaction, with said reagents possibly having a label or 
being liable to be recognized by a label reagent, more 
particularly in the case where the above-mentioned 
antibody is not labeled. 

27. Immunogenic composition comprising a 
polypeptide according to anyone of claims 6 to 11, in 
association with a pharmaceutical ly acceptable vehicle. 

28. Vaccine composition comprising among other 
immunogenic principles anyone of the polypeptides 
according to claims 6 to 11 or the expression product 
of claim 19, possibly coupled to a natural protein or 
to a synthetic polypeptide having a sufficient 
molecular weight so that the conjugate is able to 
induce in vivo the production of antibodies 
neutralizing Mycobacterium tuberculosis , or induce in 
vivo a cellular immune response by activating M> 
tuberculosis antigen-responsive T cells. 

29. Peptides of claim 6, advantageously used to 
produce antibodies, particularly monoclonal antibodies 
and which have preferably the following amino acid 
sequences (referring to Figure 1) : 

3 8 H2N-DGLRAQDDYNGWDINTPAFE-COOH 57 
78 HsN-TDWYQPSQSNGQNYTYKWET-COOH 97 
174 HgN-ANSMWGPSS DPAWKRNDPMV-COOH 193 
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204 


HjN-RIWVYCGNGTPSDLGGDNIP- 


-COOH 


223 


235 


HzN-NQTFRDTYAADGGRNGVFNF- 


-COOH 


254 


250 


HgN-GVFNFPPNGTHSWPYWNEQL- 


■COOH 


269 


275 


H2N-DIQHVLNGATPPAAPAAPAA- 


•COOH 


294 


30 


Mycobacterium bovis 


BOG 


vaccine 



transformed by a recombinant DNA sequence coding for a 
polypeptide according to anyone of claims 6 to 11, and 
possibly comprising the promoter sequence of said 
polypeptide, and an epitope of a polypeptide sequence 
heterologous with respect to said polypeptide. 

31. Necessary or kit for the diagnosis of prior 
exposure of a sxibject to tuberculosis , with said 

necessary or kit containing a preparation of at least 
one of the polypeptides or of the peptides according to 
anyone of claims 6 to 12, with said preparation being 
able to induce in vivo, after being intradermal ly 
injected to a subject, a delayed type hypersensitivity 
reaction at the site of injection, in case the subject 
has had prior exposure to tuberculosis . 



wo 92/14823 



PCT/EP92/00268 



1/25 



m 



o 

Cvl 



O 



[I 

c 

o —] 

CO 



So 



CSI O 



q 



SUBSTITUTE SHEET 



wo 92/14823 



PCr/EP92/00268 



AGGTGTCCGG GCCGACGCTG AATCGTTAGC CAACCGCGAT 40 

CTCGCGCTGC GGCCACGACA TTCGAACTGA GCGTCCTCGG 80 

TGTGTTTCAC TCGCCCAGAA CAGATTCGAC CGCGTCGTGC 120 

GCAGATGAGA GTTGGGATTG GTAGTAGCT ATG ACG TTC 158 

Met Thr Phe 
-46 

TTC GAA CAG GTG CGA AGG TTG CGG AGC GCA GCG 191 
Phe 6Ju GJn l^eJ Arg Arg Leu Arg Sen AJe AJa 
-40 -35 

ACA ACC CTG CCG CGC CGC GTG GCT ATC GCG GCT 224 
Thr Thr Leu Pro Arg Arg VaJ AJa IJe AJa AJa 
-30 -25 

ATG GGG GCT GTC CTG GTT TAC GGT CTG GTC GGT 257 
Met: GJy AJa VaJ Leu l^aJ Tyr GJy Leu VaJ GJy 
-20 -15 

ACC TTC GGC GGG CCG GCC ACC GCG GGC GCA TTC 290 
Thr Phe GJy GJy Pro AJa Thr AJa GJy AJa _Phe 

-10 -5 -1 1 



TCT AGG CCC GGT CTT CCA GTG GAA TAT CTG CAG 323 

Sen Arg Pro Gly Leu Pro Val Glu Tyr Leu Gin 

5 10 

Figure 1 (con't 1) 
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GTG CCA TCC GCG TCG ATG GGC CGC GAC ATC AAG 356 
Val Pro Sen Ala Ser Met Gly Arg Asp He Lys 

' 15 20 

GTC CAG TTC CAG GGC GGC GGA CCG CAC GCG GTC 3B9 
Val Gin Phe Gin Gly Gly Gly Pro His Ala Val 

25 30 

TAC CTG CTC GAC GGT CTG CGG GCC CAG GAT GAC 422 
Tyr Leu Leu Asp Gly Leu Arg Ala Gin Asp Asp 
35 35 40 

TAC AAC GGC TGG GAC ATC AAC ACC CCG GCC TTC 455 
Tyr Asn Gly Trp Asp He Asn Thr Pro Ala Phe 

45 50 

GAG GAG TAC TAC CAG TCA GGG TTG TCG GTG ATC 488 
Glu Glu Tyr Tyr Gin Ser Gly Leu Ser Val He 

55 60 

ATG CCC GTG GGC GGC CAA TCC AGT TTC TAC ACC 521 
Met Pro Val Gly Gly Gin Ser Ser Phe Tyr Thr 
65 70 

GAC TGG TAT CAG CCC TCG CAG AGC AAC GGC CAG 554 
Asp Trp Tyr Gin Pro Ser Gin Ser Asn Gly Gin 
75 SO 

Figure 1 (con*t 2) 
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AAC TAG ACC TAG AAG TGG GAG ACC TTC CTT ACC 
Asn Tyr Thr Tyn Lys Trp Glu Thr Phe Leu ThP 
85 90 95 



587 



AGA GAG ATG CCC GCC TGG CTA CAG GCC AAC AAG 620 

Arg Glu Met Pro Ala Trp Leu Gin Ala Asn Lys 

100 105 

GGC GTG TCC CCG ACA GGC AAC GCG GCG GTG GGT 653 

Gly Val Ser Pro Thr Gly Asn Ala Ala Val Gly 
110 120 

CTT TCG ATG TCG GGC GGT TCC GCG CTG ATC CTG 686 

Leu Ser Met Ser Gly Gly Ser Ala Leu lie Leu 
125 130 

GCC GCG TAC TAC CCG CAG CAG TTC CCG TAC GCC 719 

Ala Ala Tyr Tyr Pro Gin Gin Phe Pro Tyr Ala 
135 140 

GCG TCG TTG TCG GGC TTC CTC AAC CCG TCC GAG 752 

Ala Ser Leu Ser Gly Phe Leu Asn Pro Ser Glu 

145 150 155 

GGC TGG TGG CCG ACG CTG ATC GGC CTG GCG ATG 785 

Gly Trp Trp Pro Thr Leu He Gly Leu Ala Met 

160 165 



AAC GAC TCG GGC GGT TAC AAC GCC AAC AGC ATG 
Asn Asp Ser Gly Gly Tyr Asn Ala Asn Ser Met 



170 



Fi gure 1 ( con ' t 3 



,175 



818 
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Figure 1 (con't 4) 

TGG GGT CCG TCC AGC GAC CCG GCC TGG AAG CGC 851 

Trp Gly Pro Ser Ser Asp Pro Ala Trp Lys Arg 

180 185 

AAC GAC CCA ATG GTT CAG ATT CCC CGC CTG GTC 884 

Asn Asp Pro Met Val Gin lie Pro Arg Leu Val 
190 195 



GCC AAC AAC ACC CGG ATC TGG GTG TAC TGC GGT 917 
Ala Asn Asn Thr Arg He Trp Val Tyr Cys Gly 
200 205 210 

AAC GGC ACA CCC AGC GAC CTC GGC GGC GAC AAC 950 
Asn Gly Thr Pro Ser Asp Leu Gly Gly Asp Asn 

215 220 



ATA CCG GCG AAG TTC CTG GAA GGC CTC ACC CTG 983 
He Pro Ala Lys Phe Leu Glu Gly Leu Thr Leu 
225 230 



CGC ACC AAC CAG ACC TTC CGG GAC ACC TAC GCG 1016 

Arg Thr Asn Gin Thr Phe Arg Asp Thr Tyr Ala 
235 240 

GCC GAC GGT GGA CGC AAC GGG GTG TTT AAC TTC 1049 

Ala Asp Gly Gly Arg Asn Gly Val Phe Asn Phe 
245 250 

CCG CCC AAC GGA ACA CAC TCG TGG CCC TAC TGG 1082 

Pro Pro Asn Gly Thr His Ser Trp Pro Tyr Trp 
255 260 265 



SUBSTITUTE SHEET 



wo 92/14823 



PCr/EP92/00268 



6/25 

AAC GAG CAG CTG GTC GCC ATG AAG GCC GAT ATC 1115 
Asn Glu Gin Leu Val Ala Met Lys Ala Asp He 

270 275 

CAG CAT GTG CTC AAC GGC GCG ACA CCC CCG GCC 1148 
Gin His Val Leu Asn Gly Ala Thr Pro Pro Ala 
280 285 

GCC CCT GCT GCG CCG GCC GCC TGA GCCAGCAAGC 1182 
Ala Pro Ala Ala Pro Ala Ala TER 
290 

CAGCATCGGC AGCAGCGCAA CGGCCAGCG 1211 



Figure 1 (con't 5) 
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